Title: Speaker Adaptation of Hidden Markov Models Using Maximum Likelihood Linear Regression. Author: Supervisors

نویسنده

Heidi Christensen

چکیده

Material and results from the current thesis may be used freely provided that the source is stated. Abstract The work presented in this report focuses on an essential problem when doing speaker adaptation; namely how eeectively the speaker speciic information in the adaptation data is used. In the project a system has been implemented for speaker adaptation of hidden Markov models (HMM's) using the Maximum Likelihood Linear Regression (MLLR) method. MLLR is a method that transforms mixture components of HMM's by multiplying the mean vectors with a transformation matrix. It introduces the concept of regression classes as a set of mixture components that are transformed similarly. The adaptation technique is implemented in C. The data used in the tests are taken from the Danish EU-ROM.1 database. All results are averaged over ten speakers. Three issues have been addressed: 1) the eeect of varying the amount of adaptation material, 2) the eeect of using diierent regression class divisions and 3) the importance of the phonetic content in the adaptation material. Tests show that the MLLR technique is very data eeective. Only 3s of speech is needed when a diagonal transformation matrix is used before a positive eeect of the adaptation is seen. When using a full matrix 5s are suucient. It was observed that there is a high dependence between the achieved performance and the regression class division. Transforming each mono-phone individually improves the phoneme cor-rectness from 50.9% for the initial speaker independent models to 58.8% for the adapted models. Based on several approaches it was concluded that there are diierences in the speaker speciic information available from diierent phonemes. Vowels were seen to vary a lot from speaker to speaker and to have relatively much innuence on the eeect of the adaptation. Fricatives on the other hand have very small inter speaker variances. The project addresses the problem of speaker adaptation of hidden Markov models using the Maximum Likelihood Linear Regression technique. Some prior knowledge of the theory of Markov models is assumed. The report is divided into four parts. The Preliminaries part contains the introductory sections including the deenition of the project. In the Theory part theory, techniques and methods are described. The implementation and tests are described in the Implementation, test and conclusion part. The last part of the report is the Appendix where additional information is given. Developed software is included on a oppy disk. References to …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Speaker Adaptation of an Acoustic Model

This paper deals with several adaptation techniques, which are of the importance in cases when the identity of a speaker is known and we want to recognize his speech. We are using three different methods, namely Maximum Apriori Probability adaptation, Maximum Likelihood Linear Regression and Constrained Maximum Likelihood Linear Regression. Each of the methods yields various benefits, therefore...

متن کامل

Improvement of MLLR Speaker Adaptation Using a Novel Method

This paper presents a technical speaker adaptation method called WMLLR, which is based on maximum likelihood linear regression (MLLR). In MLLR, a linear regression-based transform which adapted the HMM mean vectors was calculated to maximize the likelihood of adaptation data. In this paper, the prior knowledge of the initial model is adequately incorporated into the adaptation. A series of spea...

متن کامل

Adaptation of acoustic models for multilingual recognition

This paper evaluates the recognition performance of a system using acoustic models transformed across language boundaries. Parameters of hidden Markov models (HMMs) trained on speaker independent English data are adapted using Afrikaans adaptation data to realise speaker dependent, multispeaker and speaker independent Afrikaans models. Adaptation is performed using maximum a posteriori probabil...

متن کامل

Quasi-Bayes linear regression for sequential learning of hidden Markov models

This paper presents an online/sequential linear regression adaptation framework for hidden Markov model (HMM) based speech recognition. Our attempt is to sequentially improve speaker-independent speech recognition system to handle the nonstationary environments via the linear regression adaptation of HMMs. A quasi-Bayes linear regression (QBLR) algorithm is developed to execute the sequential a...

متن کامل

Discriminative speaker adaptation with conditional maximum likelihood linear regression

We present a simplified derivation of the extended Baum-Welch procedure, which shows that it can be used for Maximum Mutual Information (MMI) of a large class of continuous emission density hidden Markov models (HMMs). We use the extended Baum-Welch procedure for discriminative estimation of MLLR-type speaker adaptation transformations. The resulting adaptation procedure, termed Conditional Max...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1996

Title: Speaker Adaptation of Hidden Markov Models Using Maximum Likelihood Linear Regression. Author: Supervisors

نویسنده

چکیده

منابع مشابه

The Speaker Adaptation of an Acoustic Model

Improvement of MLLR Speaker Adaptation Using a Novel Method

Adaptation of acoustic models for multilingual recognition

Quasi-Bayes linear regression for sequential learning of hidden Markov models

Discriminative speaker adaptation with conditional maximum likelihood linear regression

عنوان ژورنال:

اشتراک گذاری